Interactive Clustering with a High-Performance ML Toolkit

نویسندگان

  • Biye Jiang
  • John Canny
چکیده

Clustering is a class of machine learning algorithms which has important applications in many different fields. Users often use clustering to find hidden structures from data for those domain specific problems. However, evaluating clustering results is always a hard problem. In many and perhaps most of these applications, users need to trade off competing goals and encode prior knowledge into the model to define what is the best result. The learning algorithm however has evolved around the optimization of a single, usually narrowly-defined criterion, which may not obtain satisfactory results. In most cases, an expert makes trade-offs between different criteria which requires high-level (human) intelligence. This motivates us to provide interactive customization and optimization so that the expert can incorporate secondary criteria into the model-generation process in an interactive way. In this demo paper we will demonstrate the techniques we developed to do customized and interactive model optimization for clustering algorithms. The keys to the approach are (i) high-performance training so that non-trivial models can be trained in real-time (using roofline design and GPU hardware), (ii) a machine learning architecture which is modular, and supports primary and secondary loss functions, and (iii) highly-interactive visualization tools that support dynamic creation of visualizations and controls to match the bespoke criteria being optimized.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application Experiences with the Globus

The development of applications and tools for high-performance \computational grids" is complicated by the heterogeneity and frequently dynamic behavior of the underlying resources; by the complexity of the applications themselves, which often combine aspects of supercomputing and distributed computing; and by the need to achieve high levels of performance. The Globus toolkit has been developed...

متن کامل

Application Experiences with the Globus Toolkit

The development of applications and tools for highperformance “computational grids” is complicated by the heterogeneity and frequently dynamic behavior of the underlying resources; by the complexity of the applications themselves, which often combine aspects of supercomputing and distributed computing; and by the need to achieve high levels of performance. The Globus toolkit has been developed ...

متن کامل

Design of an Application Development Toolkit for HPF / Fortran 90 D

The development of eecient application software capable of exploiting available High Performance Computing (HPC) systems is non-trivial and is largely governed by the availability of suuciently high-level languages, tools, and application development environments. In this paper we describe the design and operation of a toolkit for HPF/Fortran 90D application development. The toolkit incorporate...

متن کامل

Interactive Form-Generation in High-Performance Architecture Theory

Architecture as a designerly way of thinking and knowing is to interact with its environment. The manuscript is to speculate “interactive form-generation” based on high-performance architecture theory, and discuss the precursors and the potentials. The research aims to explore and determine the roots, aspects of interactive architecture as a part of performance-based design in contemporary arch...

متن کامل

Feasibility of using Medical Imaging Interaction Toolkit in volumetric studies to accurate diagnosing of vascular emboli by Extended NURBS-based Cardiac-Torso phantom

Introduction: Important complications of venous thromboembolism (VTE) are a longer hospital stay, readmission, recurrence of the emboli, complications of anticoagulant therapy and death in a sever condition. In present study, the volume measurement accuracy of the medical imaging interaction toolkit (MITK) software on determining VTE in computed tomography images was evaluated....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015